molecular inverse problem
Deep imitation learning for molecular inverse problems
Many measurement modalities arise from well-understood physical processes and result in information-rich but difficult-to-interpret data. Much of this data still requires laborious human interpretation. This is the case in nuclear magnetic resonance (NMR) spectroscopy, where the observed spectrum of a molecule provides a distinguishing fingerprint of its bond structure. Here we solve the resulting inverse problem: given a molecular formula and a spectrum, can we infer the chemical structure? We show for a wide variety of molecules we can quickly compute the correct molecular structure, and can detect with reasonable certainty when our method cannot. We treat this as a problem of graph-structured prediction, where armed with per-vertex information on a subset of the vertices, we infer the edges and edge types. We frame the problem as a Markov decision process (MDP) and incrementally construct molecules one bond at a time, training a deep neural network via imitation learning, where we learn to imitate a subisomorphic oracle which knows which remaining bonds are correct. Our method is fast, accurate, and is the first among recent chemical-graph generation approaches to exploit per-vertex information and generate graphs with vertex constraints. Our method points the way towards automation of molecular structure identification and potentially active learning for spectroscopy.
Reviews: Deep imitation learning for molecular inverse problems
The paper is clearly written and motivates the interesting application of finding molecular structures given a spectrum well. The structure of the ms could be improved, since there are some distracting jumps between method, experiments and related work. In particular, the evaluation (Sec 4) could be described in more detail and can be confusing at the first reading. For example, the threshold was only mentioned once before and it could be stated again, that it applies to the spectrum, not the geometry. Here, the paper could also benefit from giving an overview of the training and evaluation procedure, e.g. in a flow chart.
Reviews: Deep imitation learning for molecular inverse problems
The paper studies a problem of predicting the molecular structured given its NMR spectrum and the molecular formula, through deep imitation learning. The reviewers find the topic important for cheminformatics and the proposed method relevant and potentially impactful. The write-up of the paper should be improved.
Deep imitation learning for molecular inverse problems
Many measurement modalities arise from well-understood physical processes and result in information-rich but difficult-to-interpret data. Much of this data still requires laborious human interpretation. This is the case in nuclear magnetic resonance (NMR) spectroscopy, where the observed spectrum of a molecule provides a distinguishing fingerprint of its bond structure. Here we solve the resulting inverse problem: given a molecular formula and a spectrum, can we infer the chemical structure? We show for a wide variety of molecules we can quickly compute the correct molecular structure, and can detect with reasonable certainty when our method cannot.
Deep imitation learning for molecular inverse problems
Many measurement modalities arise from well-understood physical processes and result in information-rich but difficult-to-interpret data. Much of this data still requires laborious human interpretation. This is the case in nuclear magnetic resonance (NMR) spectroscopy, where the observed spectrum of a molecule provides a distinguishing fingerprint of its bond structure. Here we solve the resulting inverse problem: given a molecular formula and a spectrum, can we infer the chemical structure? We show for a wide variety of molecules we can quickly compute the correct molecular structure, and can detect with reasonable certainty when our method cannot.